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The authors study the short-time dynamics of helix-forming polypeptide chains 
using an all-atom representation of the molecules and an implicit solvation model 
to approximate the interaction with the surrounding solvent. The results confirm 
earlier observations that the helix-coil transition in proteins can be described by a 
set of critical exponents. The high statistics of the simulations allows the authors to 
determine the exponents values with increased precision and support universality of 
the helix-coil transition in homopolymers and (helical) proteins. 

I. INTRODUCTION 

While energy landscape theory and funnel concept 1,2 are powerful tool for describing the 
general characteristics of folding, many details remain to be solved. One example is the 
nature of the transitions between different thermodynamic states that are observed during 
folding of proteins. Their nature and role in the folding process are active areas of research 
as many diseases are associated with spontaneous or induced misfolding of proteins. In prin- 
ciple, these transitions can be probed in silico. However, the energy landscape of proteins is 
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characterized by a rough energy landscape with a huge number of local minima separated 
by high energy barriers. Considerable effort has been put into the development of sampling 
techniques that overcome the resulting slow convergence of computer experiments 3 ' 4 . In 
generalized-ensemble techniques 5 such as parallel tempering 6 , the broad-histogram method 7 
or multicanonical sampling 8 , simulations are performed in an artificial ensemble where the 
convergence to the equilibrium distribution is fast. Thermodynamic quantities are then cal- 
culated through re- weighting to the canonical (i.e., physical) ensemble. In Ref. 9 we applied a 
complimentary approach to protein science that leaves the "physical" dynamics unchanged. 
The underlying idea is to extract information on certain equilibrium properties from the crit- 
ical relaxation of regular canonical Monte Carlo simulations (i.e., a non-equilibrium behavior 
of the system). 

Here, we revisit this approach and study with much improved statistics the helix-coil 
transition in simple polypeptides. Since it allows us to compare our results with earlier work, 
we study in the present paper the nonequilibrium evolution process of (Ala) n chains (N = 
10, 20, 30 and 40) in short-time Monte Carlo (MC) simulations. Our investigation is later 
extended towards the 34-residue human parathyroid fragment PTH(l-34). We evaluate and 
compare for both systems the exponents that characterize the observed helix-coil transition. 



II. METHODS 

Janssen et a/. 10 ' 11 have shown that at criticality universality and scaling are already 
present early in the evolution of systems. A system, characterized by an order parameter 
m, decays at a critical temperature T = T c from an initially ordered state m = 1 (after 
a microscopic time t m j C ) according to a power-law 12 ' 13 ' 14 m(t) ~ t~$l vz . This relation is 
obtained from the more general scaling form: 

m(t, r, N) = b~P /u m{b- z t, b l/u r, b~ l N), (1) 

where b is an arbitrary chain size scaling factor and r is the reduced temperature, r = 
(T — T c )/T c . Differentiating ]nm(t, r) with respect to the temperature T at T = T c , 



d\nm(t, r, L) 



~ t 1/uz , (2) 

r=0 



Or 

leads to a scaling relation that allows one to extract the exponent 15 l/vz . Combining 
the two scaling relations allows one further to eliminate the dynamic exponent z and to 
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calculate the estimates of the critical exponent (3. Additional information on the system can 
be obtained from the scaling 16 of the second cumulant 

We use in the present paper Eq. 3 in combination with Eq. 2 to obtain estimates of the 
exponent dv. 

Another dynamic critical exponent was proposed by Majumdar et al. 17 . The authors 
studied global persistence probability P(t) which is defined as 

P(*) = 1-I>(0, (4) 

f=i 

where pit') is the fraction of the samples where the order parameter m has changed its sign 
for the first time at the instant t' . At criticality, P(t) is expected to decay algebraically as 

Pit) ~ r e * (5) 

where 9 g is the global persistence exponent. The study of the persistence behavior have 
attracted enormous interest, playing an important role in the study of systems far from 
equilibrium 18 ' 19 ' 20 ' 21 ' 22 ' 23 ' 24 . 

The question arises whether the above scaling laws apply also to transitions in proteins 
and other biological molecules for that - as finite systems - the concept of a phase transition 
is not defined. For instance, the formation of a-helices in proteins resembles crystallization. 
The nature of this helix-coil transition is a long-standing topic of research 25 and has been 
studied with various methods including renormalization group techniques 26 . A few years 
ago additional evidence was presented by one of us 27 ' 28 ' 29 that this helix-coil transition is 
for polyalanine (a homopolymer where in principle one can define the limes of an infinite 
chain) a true thermodynamic phase transition. Because of these previous results we decided 
to focus on helix-formation in order to study the scaling behavior of short-time dynamics in 
proteins. 

We define our order parameter as the number of helical residues qu = 
max ((2 < n H (T) > /(N — 2) — 1),0). Here we define a residue as helical if its backbone 
dihedral angles (0, ip) take values in the range 30 (—70° ± 30°, —37° ± 30°) and the residue 
is hydrogen bonded. The normalization factor N — 2 is chosen instead of N, the number 
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of residues, because the flexible terminal residues are usually not part of an a-helix. Our 
definition ensures that < q H < 1 and qh(T c ) = 0. 

Our short-time MC simulations of the helix-coil transition are based on a detailed, all- 
atom representation of proteins. The interaction between the atoms is described by a stan- 
dard force field, ECEPP/2 (Empirical Conformational Energy Program for Peptides, ver- 
sion 2) 31 as implemented in the program package SMMP (Simple Molecular Mechanics for 
Proteins) 32 : 

Eecepp/2 = E c + E LJ + E HB + E tor , (6) 

Ec = -£^, (7) 

(id) £Ti i 



Ehb - 2^ 32 - Zw > ( 9 ) 

E tor = J2 U l( 1±C0S ( n lXl))- (10) 
/ 

Here, (in A) is the distance between the atoms % and j, and xi is f ne ^-th torsion angle. 
The interactions between our polypeptides and water are approximated by means of an 
implicit water model which assumes that the solvation (free) energy is proportional to the 
solvent accessible surface area: 

E S olv = J2 a i A i' ( n ) 

i 

where E soLv is the solvation energy, A; L is the solvent accessible area of the i-th atom, and 
<ji is the corresponding solvation parameter. We have chosen the parameter set of Ref. 33 
that is often used in conjunction with the ECEPP force field. 



III. RESULTS AND DISCUSSION 



We start our investigation of the short time dynamics in the helix-coil transition by 
simulating polyalanine chains of lengths iV = 10, 20, 30, and 40. Our results are averaged over 
4000 independent runs for iV = 10, 20, 30 and 2000 runs for iV = 40. Errors are estimated 
by dividing these 4000 (2000) runs in bins of 100 (50) runs and calculating the fluctuation 
of the averages calculated for each bin. Figure la displays for various temperatures the time 



5 



series of our order parameter q h as a function of Monte Carlo time for Alai . Corresponding 
plots are displayed for Ala 2 o (Ala 30 ) in Fig. lb (lc), and for Ala 40 in Fig. Id. 

We expect to see in such log-log plot that curves of qu corresponding to low temperatures 
approach a constant value, while high-temperatures are characterized by an exponential 
decrease of nn- Both temperature regions are separated by the critical temperature T c 
where the corresponding curve is a straight line (in the scaling region). Our plots in Fig. 1 
indicate for iV = 10 as critical temperature T c 10 = 315 K, = 415 K for = 20 , T c 30 = 450 
K for iV = 30 ; and one of T c 40 = 470 K for our longest chain, Ala 40 . 

These estimates of the critical temperatures are lower than the ones presented in Ref. 34, 
where for N = 10, N = 20 and N = 30 the following temperatures were estimated from 
the position of the maximum in the specific heat: T c 10 = 333(2) K, T c 20 = 430(2) K and 
Tj?° = 461(2) K. Hence, for these chain sizes the estimates obtained from a short-time 
analysis seem to be lower by ~ 15 K than the ones estimated from the peak in specific heat. 
This does not indicate that one of the two sets is wrong. Even in spin systems the estimate 
of the critical temperature derived from the peak in the specific heat usually differs for small 
system size from the one obtained from the change in magnetization. Only for large system 
size both values converge. However, we remark that our values are calculated over a set of 
4000 (2000 for N = 40) independent runs while the results of multicanonical runs in Ref. 34 
contain only of order fx 10 independent events. 

Note that the scaling relation leads to similar results for all chain lengths. This can 
be seen in Fig. 2 which displays for T = T c the time series of the order parameter as a 
function of time for all four chain lengths. We focus here on the linear range and show both 
the measured data points and the straight line that best fits these data. The data for the 
different chain sizes are almost parallel to each other and therefore differ little in their slope. 
From the slope of these plots we can obtain an estimate of (5/ uz. 

Measuring in addition the slope for slightly higher (lower) temperatures and calculating 
the numerical derivative leads in addition to an estimate for l/vz. In Fig. 3, the power law 
increase of Eq. (2) is plotted in double-log scales for the for polyalanine chains. Estimates 
of the ratio d/z can be extracted from the scaling of U2{t,L) which is shown for the four 
polyalanine chains in Fig. 4. Table 1 lists the values of all three ratios f3/vz, l/vz and d/z, 
together with the exponents (5 and dv that can be extracted from them. 

In order to estimate the exponent 6 g one needs to prepare the initial states carefully to 
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obtain precise values of the initial g° = q h (t = 0). The exponent 9 g is obtained from Eq. 
5. Fig. 5 shows the corresponding log-log plots of P(t) for Ala m and various initial values 
of From these curves one extracts in the limes iV — > oo and g° — > an estimate for 
the persistence exponent 6 g = 0.73(1). The corresponding values for polyalanine chains of 
length N = 10,20,30 and N = 40, and for the polypeptide PTH(l-34), are also listed in 
table 1. 

Our values for the critical exponent (3 « 0.39 in table 1 clearly excludes (3 = 0, the 
value expected for a first order transition. A similar statement holds for du « 1.4 that also 
excludes a first-order transition (dv = 1). This is different from the case of polyalanine in 
gas phase where the helix-coil transition is of first-order type 28 . 

The estimates listed in table 1 indicate that the finite size effects are less than the sta- 
tistical errors and that reliable estimators for critical exponents can already be obtained on 
very small chains. This suggests that short-time analysis can also be applied to proteins 
where an extrapolation to infinite chain length is not possible. Here, short-time analysis 
opens a way to characterize thermodynamic processes in proteins by defining analogues to 
phase transitions. 

We have checked this assumption for the peptide fragment PTH(l-34) corresponding 
to residues 1-34 of human parathyroid hormone 35 ' 36 ' 37 . This fragment is sufficient for the 
biological activities of the hormone suggesting medical and pharmaceutical applications 
of the peptide. The structure of PTH(l-34) has been resolved both crystallized 36 and in 
solution 37 and is at room temperature almost completely helical. In previous work 38 using 
the same energy function it was shown that PTH(l-34) exhibits at T = 560(10) K a sharp 
transition between a high temperature region where disordered coil structures prevail, and 
a low-temperature region that is characterized by mostly helical structures. The nature of 
this helix-coil transition could not be established in Ref. 38. 

Observing the short-time dynamics of 500 runs for the helix-coil transition in PTH(l-34) 
we obtained data for temperatures T = 520, 530, 540, 545, 550 and 560 K. Fig. 6 displays 
our order parameter for various temperatures as a function of Monte Carlo time. From the 
plot we estimate that the critical temperature is T = 545(5) K, a value that is again slightly 
smaller than the obtained from the peak in specific heat. The scaling region is shown for 
this temperature in Fig. 7, with both the data point and the best straight-line fit through 
them. 
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Note that our estimate of the critical temperature differs slightly from the one (T = 540 
K) reported in Ref. 9. The slightly lower value that was obtained in this preliminary work 
with much lower statistics led to critical exponents different from the ones for polyalanine. 
Our new data indicate however that the critical exponents (listed also in table 1) are the 
same for the polyalanine chains and the polypeptide PTH(l-34). This is an interesting result 
as it indicates that the order of the helix-coil transition does not depend on the side chains 
of the residues in a protein. On one hand we have (poly)alanine with its extrem short side 
chain, on the other PTH(l-34) where the side chains differ for each residue and therefore 
side chain ordering is much more difficult to obtain than in a homopolymer. Hence, our 
new results suggest "universality" of the helix-coil transition in homopolymers and (helical) 
proteins. 

IV. SUMMARY AND CONCLUSION 

We have investigated the short-time dynamics of helix-forming polypeptides with up 
to N = 40 residues. At their critical temperature the evolution of these molecules can be 
described by scaling laws that allow on to extract various critical exponents that characterize 
the helix-coil transition in these molecules. Our results indicate "universality" of the helix- 
coil transition in protein-like molecules. They also demonstrate that the analysis of short- 
time dynamics is a valuable tool for the investigation of transitions in proteins that may 
lead to a deeper understanding of their folding mechanism. 
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Tables 



N 

1 V 


ft Iu7 
PI UZ 


1 Iu7 


dl 7 


ft 


Lit/ 


°g 


10 


0.303(3) 


0.772(7) 


1.099(8) 


0.392(7) 


1.42(3) 


0.402(6) 


20 


0.304(3) 


0.782(9) 


1.100(9) 


0.389(8) 


1.40(3) 


0.687(7) 


30 


0.301(5) 


0.779(9) 


1.09(1) 


0.39(1) 


1.40(3) 


0.708(7) 


40 


0.299(5) 


0.78(1) 


1.09(1) 


0.38(1) 


1.42(4) 


0.710(8) 


PTH 


0.290(5) 


0. 768(8) 


1.09(1) 


0.38(1) 


1.42(3) 


0.705(6) 



TABLE I: Exponents as obtained for polyalanine chains of length iV = 10, 20 and 40 from the 
scaling relations of Eqs. 1, 2, 3 and 5. The columns 5 and 6 list the critical exponents (5 and 
dv as calculated from these quantities. The last row summarize the results for the polypeptide 
PTH(l-34). 
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Figures 

Fig. 1 Log-log plot of the time series of the helical order parameter q H (defined in the text) 
as a function of Monte Carlo time for polyalanine chains of length (a) N — 10, (b) 
N = 20, (c) N = 30; and (d) N = 40. 

Fig. 2 Log-log plot of the time series of the helical order parameter q H as a function of 
Monte Carlo time for polyalanine chains of length N = 10, 20, 30 and 40 as measured 
at their respective critical temperatures. Shown are the data in the scaling region and 
the best fit through them. 

Fig. 3 Log-log plot of the time series of the logarithmic derivative of the order parameter 
(Eq. 2) as a function of Monte Carlo time for polyalanine chains of length N = 
10, 20, 30 and 40, and for the polypeptide PTH(l-34). 

Fig. 4 Log- log plot of the time series of the second moment U 2 (t, L) as a function of Monte 
Carlo time for polyalanine chains of length N = 10, 20, 30 and 40 and the polypeptide 
PTH(l-34) as measured at their respective critical temperatures. Shown are the data 
in the scaling region and the best fit through them. 

Fig. 5 Log-log plot of the time series of the global persistency P(t) (defined in the text) 
for Ala^o and various initial order parameter values q®. 

Fig. 6 Log-log plot of the time series of the helical order parameter q H as a function of 
Monte Carlo time for the polypeptide PTH(l-34) measured at temperatures T = 
520, 530, 540, 545, 550 and 560 K. 

Fig. 7 Log-log plot of the time series of the helical order parameter q H as a function of 
Monte Carlo time for the polypeptide PTH(l-34) at the critical temperature T c = 545 
in the scaling region, with the best fit also drawn through them. 
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